Hebrew Vowel Restoration With Neural Networks
نویسندگان
چکیده
Modern Hebrew is written without vowels, presenting a problem for those wishing to carry out lexical analysis on Hebrew texts. Although fluent speakers can easily replace vowels when reading or speaking from a text, there are no simple rules that would allow for this task to be easily automated. Previous work in this field has involved using statistical methods to try to solve this problem. Instead we use neural networks, in which letter and morphology information are fed into a network as input and the output is the proposed vowel placement. Using a publicly available Hebrew corpus containing vowel and morphological tagging, we were able to restore 85% of the correct vowels to our test set. We achieved an 87% success rate for restoring the correct phonetic value for each letter. While our results do not compare favorably to previous results, we believe that, with further experimentation, our connectionist approach could be made viable.
منابع مشابه
An HMM Approach to Vowel Restoration in Arabic and Hebrew
Semitic languages pose a problem to Natural Language Processing since most of the vowels are omitted from written prose, resulting in considerable ambiguity at the word level. However, while reading text, native speakers can generally vocalize each word based on their familiarity with the lexicon and the context of the word. Methods for vowel restoration in previous work involving morphological...
متن کاملMany ways to read your vowels - Neural processing of diacritics and vowel letters in Hebrew
The current study examined the effect of orthographic transparency and familiarity on brain mechanisms involved in word recognition in adult Hebrew readers. We compared the effects of diacritics that provide transparent but less familiar information and vowel letters that increase orthographic transparency without compromising familiarity. Brain activation was measured in 18 adults during oral ...
متن کاملLanguage Support A Simple Technique for Typesetting Hebrew with Vowel Points
This paper describes a simple mechanism for typesetting Hebrew with vowel points. Hebrew uses a large set of accents that represent vowels, consonant modifiers, and cantillation instructions. These accents are placed above, below, or inside letters; a single letter can carry several accents. The solution that we describe, which is designed for PostScript [2] output devices, leaves the placement...
متن کاملVowel reduction in Modern Hebrew: Traces of the past and current variation
The aim of this paper was to find out the scope and boundaries of a-reduction in Modern Hebrew. In Classical Hebrew, vowel reduction was a regular, obligatory process. In Modern Hebrew, it has restricted scope and operates under opaque conditions. The only reliable trace of the historical motivation for the rule is the Hebrew vocalization system (nikud). 100 participants in four age groups were...
متن کاملRemarks on the Development of Some Pronominal Suffixes in Hebrew
The paradigmatic pressure for the preservation of the final vowels of pronominal suffixes after long vowels, where gender opposition could not be marked by the preceding vowel, was strong enough to create in rabbinic Hebrew, in Aramaic, and in Arabic, dialect doublets, viz., suffixes without final vowel after originally short vowels (as rabbinic Hebrew yadak 'your hand'), and those with final v...
متن کامل